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Abstract. We consider multilinear averages in ergodic theory and harmonic analysis 
and prove their divergence in some range of L p spaces. This contrasts with the positive 
behavior exhibited by these averages in a different range, as proved in [5]. We also prove 
that the trilinear Hilbert transform is unbounded in a similar range of L p spaces. The 

U— i ' underlying principle behind these constructions is stated, setting up the stage for more 

If} ■ general results. 



1. Introduction 



< 

u 

Multilinear ergodic averages have proved to be a powerful tool in settling problems from 
combinatorial number theory. This strategy was initiated by Furstenberg in [6]. He gave 
an ergodic theoretical proof of a theorem of Szemeredi about the existence of arbitrarily 
long arithmetic progressions in subsets of integers with positive upper density. 

Let X = (X, E, m,T) be a dynamical system, i.e. a complete probability space 
>• ' (X, S, m) endowed with an invertible transformation T : X — > X such that mT~~ x = m. 

^ ■ Furstenberg's method consisted of first proving that 

^ : i N r 

£ m JlL iV^M E / h(T l x)f 2 (T 2l x) ■ ■ ■ f n (T nl x) dm(x) > 0, 



l=M+l ' 



for each positive nonzero functions G L°°(X), and then using a corresponding principle 
to transfer information to the integers. Subsequently, other combinatorial averages have 
been investigated by various authors. 

Let now n > 1, m > 1 and consider the (n-l)xm matrix A = {ciij^Zi^ with integer 
entries. We will consider the averages 



n—l 



N' 

l<ni,... ,n m <N i=l 



where /i, . . . , f n -i are measurable functions on X. When m = 1 and a i; i = i, we obtain 
the aforementioned Furstenberg's averages 

^7 E S/*(^)- ( 2 ) 



N 

KKN i=l 



A related object of interest is given by the averages on m dimensional cubes. Some 
version of them played a key role in Gowers' proof of Szemeredi's theorem [7]. They 
correspond to the case where n = 2 m and A is a matrix having on each column a vector 
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from V m = {0,l} m \ 0. More precisely, if ei,... , e 2 m_i is the enumeration of V m in 
lexicographically increasing order, then the averages on m dimensional cubes are 

K m (f ei ,...J^ n _ 1 ,N)(x) = ^ Rfc^x), (3) 

%&{i,2,...N} m £ eVm 

where i ■ e denotes the usual dot product. 

Both the averages in (2) and those in (3) have been shown to converge in the L 2 -norm 
when all the functions f t are in L°°(X), a result due to Host and Kra [9] (a different proof 
for the averages in (2) is due to Ziegler [14]). 

As for the almost everywhere convergence of (1), there are two interesting lines of 
investigation. On the one hand, it is legitimate to ask whether convergence holds for L°° 
functions. Bourgain proved in [3] using Fourier analysis on the torus that the averages 
in (2) converge almost everywhere for L°° functions, in the case n — 3. For larger n only 
partial results are known, for transformations T with nice spectral properties [2], [13]. 
Assani [1] used the semi-norms introduced by Host and Kra in [9] to prove the almost 
everywhere convergence of the averages on cubes for arbitrary m, again for bounded 
functions. 

Once pointwise convergence is established for a dense class of functions (like L°°(X)), 
the extension of this convergence to other L p spaces becomes synonymous with the exis- 
tence of a maximal inequality for the associated maximal operator. In [5] , a very general 
result is proved, showing the boundedness of the maximal operator 

n—1 

7Xx(A, ■ • • , fn-i)(x) = sup — II \fi(T^ ai ^x)\ 

Af - 1 \n 1 \,...,\n m \<N i=l 

in some range of L p spaces depending on the rank properties of the extended matrix E(A) 
defined as 



/ °1,1 


0-1,2 


Ol tJn 


1\ 


0-2,1 


&2,2 


02,m 


1 


O n -l,l 


^n- 1,2 • • 


On—l,m 


1 


V 








V 



Our main objective here is to contrast this result with some negative ones, showing that 
divergence may occur for some functions in L p , with p sufficiently close to 1, due to the 
maximal operator above failing to be bounded. One of the important aspects of these 
counterexamples is that they give an explanation to why the time frequency methods used 
in [5] can not be used all the way up to L 1 . 

One can also look at the averages in (1) from a different angle and analyze their real 
variable analog 

n—l m 

Ylfi(x + ^2atjtj)£, (4) 

l,-,l*m|<£ i= l j = l 

where f\, . . . , / n _i are arbitrary measurable functions on R. The question of norm con- 
vergence for bounded functions is trivial in this context (one uses approximation and the 
immediate result for the span of the characteristic functions of intervals). However, the 
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study of the pointwise convergence of the averages in (1) is intimately connected to that 
of the averages in (4). Indeed, a maximal inequality (or the failure of it) in one setting 
is equivalent to the existence (or failure) of a maximal inequality for the corresponding 
maximal operator in the other context. The transition back and forth between the reals 
and a dynamical system is mediated by the integers, which can be regarded as a copy of 
each individual orbit sitting naturally inside the reals. We will do all the constructions in 
the real variable setting and then transfer them to the averages in (1). 

More interestingly, our counterexamples also apply with no modification to the case of 
the corresponding singular integral operators. In particular, we show that the trilinear 
Hilbert transform is unbounded in some range of exponents. 

Most of the results from the following section have been announced in [5]. 

2. Main results 

We start with the divergence of Furstenberg's nonconventional averages (2) for functions 
in W spaces, with p close to L l . We will then sketch the more delicate argument for the 
averages on cubes. 

Theorem 2.1. Define p = 1 + 1+1^2 anc ^ cons ider p < po- In every ergodic dynamical 
system X = (X, E, /i, T) there are three functions F,G, H G L P (X) such that 



1 N 

lim sup — V F(T n x)G(T 2n x)H(T 3n x) = x, 

N->oo N 



n=l 

for p, a.e. x G X . 

Proof The idea of the proof is to construct "small" subsets of integers A and B 0: such 
that 2B — A is also "small" while 2A — B is "large" . This choice is dictated by the 
geometry of the averages under investigation, as explained below. 
For a fixed k > 1 consider the following subsets of [—1, 1]: 

k 

A = {J2^l + z--*^A := {-A, -2, 0},0<z< ^^-} 
i=i 

B = {E4 + z: b > e B ° ■= I ' 1 ' 2 ' 3 )' < * < vvw^ 



1=1 

k 



C = iJ2 2 \^ + z -- * G {-4,-2,0}A G {0,1,2,3},0 < z < 
i=i 
k 

= {J2^ + Z:Qe {°' 2 > 4 ' 6 > 8 ' 10 }' < z < ^-7T^} 

i=l 
k 

D = {J2 2 -^^ + z -- a * e I- 4 ' -2,0}, 6, G {0,1,2,3},0 < z < 

i=l 

- ill Tk + z ■ d * e {- n > - 10 ' • • • , -i, 0}, < , < ^- k }. 

i=l 
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We claim that 

mjxG [-1,0] : jf U(x + t)l B (x + 2t)l c (x + 3t)dt>^—^>^, (5) 

where m denotes the Lebesgue measure. Indeed, for each x G D of the form x = 
X)i=i 2a \2i >i + z (f° r some i)i with < 2 < 8x 1 L2fc , and for each t of the form t = 

same i), with < z' < 8x 1 L2fc , we immediately see that x + £ G A, 
x + 2t E B and x + 3t G C. It now suffices to note that m(D) = |, and the claim follows. 
From here we deduce, by the definition of Riemann integral, that for some sufficiently 
large N k (independent of x) we have 

ro ^ e [-1, 0] : - g M* + _)!,(* + _)!„(* + -) > ^-^j > -. 



Now consider a p < po and the dynamical system Xk = ([— 1, 1], B, mi, T k ), where S 
is the restriction of the Lebesgue algebra, m\ is the normalized Lebesgue measure (i.e. 
rai([— 1,1]) = l)and T k (x) = x + jj-, with the addition considered modulo the interval 
[—1,1]. We can now rephrase (6) as 

mi js G [-1,1] : sw^lAmx)l B {T?x)lc(T?x) > jg^} ^ (?) 
A more useful way of stating this, upon noting that ||lA||i,p(x k ) = (4x4 1 fc) i /p , ||ls||LP(x k ) = 

(4x3 lfc ) 1/p alld H lc, H LP ( x k) = (4x2fc) 1/j " 1S 



A? 



^P mx x G [-1, 1] : sup 1 £ f(J^x)g(I^x)h(T^x) > -^-^ k | > ^ 



/HiP(X k )= (4x4fc) i/ p L JV ^ i n=l 
L_ 

(4x3 fc ) 



H9llLP(X k )= (4x3 fc ) l/ p 

ll h lliP(x k )= (4x2 i )l/p 



By using the transference principle proved in the Appendix, it follows that there ex- 
ist f k G L p {X),g k G L P (X) and h k G L P (X) with ||/fe|U^(x) = {4x iy/ P , ||flfc||£p(x) = 

(4 x3 fe)i/p i ||^fc|Up(X) = (4 x2 fc)i/p ' SUC ' 1 ^hat 



M := j^G A : sup 1 £ / fc (T n x)g k (T 2n x) h k (T 3n x) > ^1^}) ^ Tt 



16 (8) 



Define now 



A (4x4 fc ) 1/P 

fc=l 

A(4x 3*) 

9 = 1^ p 9k 

k=i 

h = l^ fc2 h »- 

k=l 
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Note that f,g,h e L P (X). Moreover, there exists a set X C X of positive measure such 
that each x from X belongs to infinitely many £k's. F° r eac h such x and A; we have 

sup 'f /( r I)9( rt W rt) > (4x4t)1/ t,r 6 3t D 4x2t)lfr ■ 

N >iN^ JK ' y ' y '~ 32A; 6 x 12 fc 

— n=l 

Due to our choice of p, the sequence on the left above goes to oo. Since 

oo 

(JT*(X )=X 

3=1 

up to sets of (j, measure 0, it follows that 



oo 1 

F : = £ -2 Tj f 

3=1 J 

oo 

3=1 J 
oo 1 



i=i J 

satisfy the requirement of Theorem (2.1). ■ 

Remark 2.2. No negative result can be proved with these techniques for the bilinear 
averages, the reason being that x + t and x + It are linearly independent monomials in 
R[x,t]. The result of Lacey [12] shows that they do behave well for some range of indices, 
however opposite behavior is quite anticipated near L 1 . 

Remark 2.3. The index p in the above theorem can be pushed as close as desired to |, 
if one considers instead the more general trilinear ergodic averages 



1 N 



N 



F(T ain x)G(T a2n x)H(T a:in x), 



n=l 



with a\ 7^ a 2 ^ a 3 G Z. (the choice for aj will depend on how close po is to |) This 
follows as a consequence of the results obtained by Christ in [4], which were also the main 
inspiration for our investigation here. On the other hand, it is a consequence of Lacey's 
result [12] and multilinear interpolation, as explained in [5], that p can be at most 2. 

Remark 2.4. The result in Theorem 2.1 immediately proves the divergence of Fursten- 
berg's averages for any n > 3 (just choose the remaining functions to be identically equal 
to one). 

Interestingly, the constructions above also prove the unboundedness of the trilinear 
Hilbert transform. This operator is initially defined for piecewise continuous functions fi 
with finite support as follows: 

r dt 
H 3 (fi,f3, f 3 )(x) = p.v. j h{x + t)f 2 (x + 2t)f 3 (x + 3t) j 
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The bilinear version H 2 of this -also known as the bilinear Hilbert transform 

/i i 
h(x + t)f 2 (x + 2t)j, 

was proved to be bounded by Lacey and Thiele [10], [11] in some range of exponents. 
The type of time-frequency analysis involved in their argument seems insufficient at the 
moment to address any positive bounds for H 3 . 

Theorem 2.5. Define po = 1 + i°^^ 2 and cons ^der p < p . Then the inequality 

||^3(/l,/3,/s)||p/3<C||/i|y/ 2 || p ||/ 3 ||p (9) 

fails to hold with a universal constant C, independent of /j. 

Proof It suffices to note that with the notation in the proof of Theorem 2.1 we have the 
following consequence of (5) 



m 



( f°° 1 1 1 

\x e [-1, 0] : J U{x + t)l B (x + 2t)l c (x + 3t)j> ^—^j > g- 



To see this, observe that since x is negative and since B consists only of positive numbers 
we have 

f°° dt r l 

H 3 (l A ,l B ,l c )(x) = / l A (x+t)l B (x+2t)l c (x+3t) — > / l A (x+t)l B (x+2t)l c (x+3t)dt 
Jo t Jo 

for each x E [-1,0]. Note also that as before ||1a||lp = (4 x4 fc)i/p » = (4x3 1 fc) i /p , 

II lolUf = ( 4x2 fc)i/p > anci II-^3(1a, 1b, lc) ||p/3 > p7F ( 8 xi2 fc ) - B y using /i = 1 A , fl = 1b, 
f 3 = lc, a simple computation now shows that the constant C in (9) will go to oo as k 
goes to oo. 

■ 

The following analog of Theorem 2.1 holds for the averages on cubes (3). 

Theorem 2.6. For each m > 3, for each p < 2 ^ ra+ ^ 1 , given any ergodic dynamical system 
X = (X, E, /i, T), there are 2 m — 1 functions F t e L P (X) ; e e V^, satisfying 

limsup K m (F ei , . . . , F £2m _ 1 , AT)(x) = oo 

/or fi a.e. iGl. 

Proof As before, we first analyze the analog of the above averages on R, defined by 

K m (f ei , f €2m _ 1 ,\)(x) := J ... J Yl F ^ X + **• € ) dt ± ■ ■ ■ dt m, 

where t — (£i, . . . , t m ). Fix a /c > 1. Define first 



Ai,...,i j -i,o,i i+ i,.,i) = {J2 ( 2 ^fi)i : ai G {°' 2J ^ 
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for each 1 < j < m, (which will be denoted with Bj, for simplicity in future reference), 
and also 

k 

(which will be denoted with B, for simplicity in future reference). All sets A are indexed 
by vectors in V m . We uniquely determine the remaining A e 's as being the minimal sets 
which satisfy the constraint: for each x,t±, . . . , t m G R satisfying 

^ + t\ + . . . + tj-i + + . . . + t m G ,ij_ 1 ,o,ij+i,... ,i) 

(10) 

for each 1 < j < m, and 

z + *i + ... + f m (11) 

we also have that 

x + f • e G A e 

for all the remaining e G V^. A simple inspection shows that the monomials in x and t 
from (10) and (11) are linearly independent. Also, for each 1 < / < m — 2 and for each 
e G V m with exactly / nonzero entries, A e can be written as a linear combination of exactly 
m + 1 — I of the m + 1 yl e 's from (10) and (11). For example 

Ai,o,... ,o) = #i + #3 + . . . + B m - (m - 2)B, 

Ai.1,0,- ,o) = #i + #4 + • • • + B m - (m - 3)B, etc. 

Since each Bj and B have 2 fc elements, we conclude that for each 1 < / < m — 2, (™) of 

the A e 's will have at most 2^ m ~ l+1 ^ k elements. For I — m — 1, we have m such A € (i.e. the 
.Bj's) with 2 fc elements each, while for / = m there exists one such set (namely B) with 
2 k elements. Note also that (10) and (11) imply that 

x eC:= B x + ... + B m -(m-l)B (12) 

tj G Cj :=B- B j+1 (13) 

for each 1 < j < m, with the obvious agreement that B m+ \ := B\. Also, for each 
x = b\ + . . . + b rn — b E C there exists a unique m tuple t = (ti, . . . , t m ) G Fljli 
namely 

(ti, . . . ,t m ) = (b - b 2 , b - b 3: . . . , b - h), (14) 

such that (10) and (11) hold. 
One can easily see that 

has 2 fc(m+1 ) elements, and each two of them are separated by at least 2k{ i +1) . Define now 
the sets 

A e = A e + [0, 2k(m+1) ] 
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for each e G V n and 

C = C+[0, 



(m + l)2 fc ( m+1 ) J 



Take now some x = b± + . . . + b m — b + z <E C, with < z < ( - m+1 ^ fc(m+1) . The above 
discussion shows that for each tj G b — bj +1 + [0, ( m+1 )2fc(m+i) ] we have that for each e G V m 

x + t- e G A e . 



Define the functions 
and note that 

and also that 



fk,e — 1a e 



||/fc, e || iP (]R) < I 2 fc(m+l) 



/'f'-: / • • • / 11 ./'/,,!•'• + / ' <!'/' > - \ > 



[(m + 1)2^™+!)]" " (m + 1)2^™+!) m + 1 ' 



The argument continues like in Theorem 2.1, and hence we are guaranteed the negative 
result as long as the sequence 

1 -r-r /9 fc ( m+1 ) \ p 

n 



\(m+ l)2 k ( m + 1 )] m 

iy ' 1 e€V, 



diverges to oo. Using the discussion on the sizes of the A^s from before, it suffices to have 

m— 2 \ p 

JJ 2 lk(™) 2 km(m+l) 



/m-2 



[(m+ l)2 fe ( m + 1 )]' 



. 1=1 



En — 2 jf m\ 
i=i l \ i . 



This is easily seen to happen for p < 1 H — ^(m+i) = m+i ■ m 

Remark 2.7. Note that a negative result is produced as soon as one realizes that at least 
one of those A e 's for which e has at most m— 2 nonzero entries, has cardinality considerably 
less than 2 k ^ m+1 \ 

As in the case of the bilinear averages analyzed earlier, this type of constructions can not 
prove divergence for the averages on squares (m = 2), but do offer valuable information 
regarding their degenerate analog 

^2 E E f{T 2i x)g{T*x)h{T^x). (15) 

i=l j=l 

We note first that as a result of multilinear interpolation, the maximal operator associated 
with these averages maps boundedly L Pl x L P2 x L P3 into L P4 , where -V — - + - + - , 

whenever p' A > | and 1 < Pi,P2,P3- Indeed, the boundedness is immediate in the case 
Pi = T>2 = 1 + e, j»3 = oo, while the other five permutations can be seen similarly, after 
some change of variables. This is sharp as the following proposition shows. 
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Proposition 2.8. The maximal operator associated with the averages (15) fails to map 
boundedly LP 1 x L P2 x L pz into LP*, whenever p' 4 < |. 

Proof To see this, it suffices to repeat the argument in Theorem 2.1 for / = g — h — 1a 
where A := {z G R : \z\ < j^}, for large M. The maximal inequality will have a constant 
that will go to oo as M goes to oo. The details are left to the reader. ■ 

The above proposition combined with the positive result in [5] for the nondegenerate 
squares in the range p' 4 > |, makes the important point that the almost everywhere 
behavior on various LP spaces (p < oo) of the averages 

N N 

i=i j=i 

associated with general commuting measure preserving transformations Ti, Ti, T3, is very 
sensitive to the relation between the Tj's. This is unlike the case where f,g,h& L°°(X), 
for which the a.e. convergence holds for any (not necessarily commuting) transformations 
Ti,T 2 ,T 3 (see [1]). 

The main ingredient behind all the negative results proved here is the fact that the 
monomials x + Y^j=i a i,jtji 1 — * — n ~ 1 are linearly dependent in R[x, ti, . . . ,ti\. This 
allows for the functions involved to be simultaneously large, hence making the maximal 
operator big on a relevant set of x's. Similar constructions can be made for various other 
averages of this type, however it is not clear whether this approach can always be applied. 
Here is a brief account on what the main difficulty is. Let r be the smallest number for 
which one can find r linearly dependent monomials as above, and assume for simplicity 
that these correspond to i G {1, . . . , r}. Then we can always find at least r — 2 monomials 
among Y^jLi a i,jtj, 1 < i < r which are linearly independent in R[ti, ... ,t{\. One type of 
situation occurs when only r— 2 such monomials exist, say the ones with ie{l,...,r— 2}. 
By performing a suitable change of variables and by ignoring the influence of /$, i > r + 1, 
it suffices to show that the maximal operator 



j . r-2 r-2 r-2 

sup— 2 / \\\f i (x + t i )\-\f r _ 1 (x + ^b i t i )\\f r (x + ^c i t i )\dt 1 ...dt r _2, 

00 e 7| tl |,...,| tr _ 2 |< e . =1 i=1 i=1 ( 16 ) 

fails to be bounded in some appropriate range of exponents. The coefficients are 
arbitrary integers with the property that the monomial x + YllZi can be written a 
linear combination of x + ti, 1 < i < r — 2 and x + ^[=1 hU- 111 this scenario, the 
constructions would have to be sensitive to the arithmetics of the Vs an d q's. The 
second type of situation occurs when we can find r — 1 linearly independent monomials in 
R[ti, . . . ,t{\ (the degenerate case). A similar reasoning shows that in this case it suffices 
to analyze the maximal operator 

sup-^- / ]J\Mx + t i )\-\f r (x + y2b i t i )\dt 1 ...dt r - 1 , (17) 

6>o e ^|ti| |*^-i|<e i=1 i=1 

where the coefficients bi are arbitrary integers with the property that the monomial x + 
YTiZi biU can be written a linear combination of x + t i: 1 < i < r — 1. In this case, the 
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same construction like the one described above for the averages on degenerate squares 
will produce divergence results for some G L p , 1 < i < r, whenever p < ^-p 



3. Appendix 

For a complete nonatomic probability space (X, S,/i), denote by C(X) the family of 
all the invertible yU-measure preserving transformations T of X. Equip C(X) with the 
topology of weak convergence, in which T s — > T if and only if /j,(T s AATA) — > for each 
A G E. If a second complete nonatomic probability space (Y, F, v) is present, we will 
denote by C(Y,X) the set of all invertible, bimeasurable transformations (3 : Y — > X 
which take measure v to measure \i. The following result is due to Halmos [8], and is the 
key to the proof of the transference lemma 3.2: 

Theorem 3.1. Given an ergodic dynamical system Y = (Y,F,v,S) and a complete 
nonatomic probability space (X, E, fj,), the set 

{pS(3-\(3eC(Y,X)} 

is dense in the weak topology ofC(X). 

Lemma 3.2 (Transference Principle). For a given dynamical system X T = (X, E,/x, T), 
p > 1 and positive constants a, b, c, X, define 



7(X T , a, b, c, A) = sup fi 

H-f HiP(X)= a 
H' i IIlP(X)= c 



x G X : SU P 77 E f(T n x)g(T 2n x)h(T 3n x) > \\ . 



Then for any ergodic dynamical system Y s = (Y, T , S) we have 

7(X T , a, b, c, A) < 7(Y S , a, b, c, A). 

The first goal here is to show that for each < r < 1, 

S r = {a G C{X) : 7 (X CT , o, 6, c, A) < r} 

is closed in the weak topology of C(X). Let a s be a net in C(X) converging weakly 
to some a, such that 7(X CTs , a, b, c, A) < r. Take f,g,h G L P (X T ) with ||/||lp(x t ) = a i 
||(/||lp(x t ) = & and ||/i||z,p(x T ) = c • From the definition of the weak topology, foa s — > /ocr 
in the norm of L p (Xt) and similarly for g and /i. There exists a subnet indexed by Z + , 
which we will denote by (07) , such that f°o~i^foa,goai^goa and /10 cr/ — > ho a, in 
the L p -norm. Choose a subset X C X of full measure such that lim^oo f(o~ix) = f(ax) 
and simultaneously for g and h, for each z G Xq. This is easily seen to imply that 

N N 

sup — V /(a n x)^(a 2 "a;)/i(a 3ri x) < liminf sup — V /«a;)^(a i 2n x)/i(af n x) 

AT>1 iV ^—f i-»oo A r> 1 AT ^— ' 

— n=l — n=l 
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for each x G X . So 



1 N 

fi{x G X : sup - f(v n x)g(a 2n x)h(a 3n x) > A} 



jv>i N 

— n=l 



fi{x G X : liminf sup ^ Y] f(crl l x)g(a 2n x)h(af n x) > A} 

— n=l 
1 N 

lim G X : inf sup — f(tfx)g(a 2n x)h(af n x) > A} 



i >™Af>l -AT 

— n=l 



l * 

< lim ^61: sup — V/(a>)p(<7^ n x)/i(a^) > A} 

n. — >r>n Arv -1 /V f * 



AT 

— n=l 

< r. 

The fact that S r is closed follows immediately. 

For the last part of the proof, consider an arbitrary system Xt = (X, £,//, T). Take 
arbitrary functions f,g,h G L p (Xt) with L p norms equal to a, b and c, respectively. 
Theorem 3.1 guarantees the existence of a net of transformations ()3 S ) C C(Y,X) such 
that (3 s Sl3~ l — > T in the weak topology. It is an easy verification that 

1 N 

Af>i iV ' 
— n=l 

1 N 

= su p ^ Y.f^ s ^ l ) n ^)9msp: l f n x)hmsp; i f n x) 

N>1 W , 
— n=l 

for \x a.e. x. The functions / o (3 S , g o f3 s and h o (3 S have the same norms a, b and c 
respectively in L P (Y S ) , hence 

1 N 

n{x G AT : sup — ^f((/3 s S/3~ 1 ) n x)g((/3 s Sf3~ 1 ) 2n x)h((f3 s Sf3~ 1 ) 3n x) > A} < 7(Y S , a, 6, c, A). 

— ?1=1 

Since f,g,h were arbitrary, we get that for each s, 

7( Y /j sS/3s -i, a, b, c, A) < 7(Y S , a, 6, c, A) 
The fact that S^Ys^Ac,*) is closed in C(X) finishes the proof. 

Remark 3.3. The obvious analog of the above Transference Priciple also holds, with no 
essential modifications in the proof, for each of the averages in (1). 
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